Cellular-phone based speech-to-speech translation system ATR-MATRIX

نویسندگان

  • Rainer Gruhn
  • Harald Singer
  • Hajime Tsukada
  • Masaki Naito
  • Atsushi Nishino
  • Atsushi Nakamura
  • Yoshinori Sagisaka
  • Satoshi Nakamura
چکیده

We describe the implementation of a cellular-phone based speech translation system without telephone quality speech database or special CT hardware. The purpose is to quickly build a prototype service system that can be used for data collection with real users. To train the acoustic model for the speech recognition system, available high-quality databases were made usable by 1.) appropriate downsampling and ltering of high-quality databases, and 2.) by piping, similar to the NTIMIT and CTIMIT paradigms. An evaluation of acoustic models with ltered, piped and real cellular-phone data is given. Recognition rates are at same levels as for wideband speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of the ATR-matrix speech translation system with a pair comparison method between the system and humans

The main goal of the present paper is to propose a new scheme for the overall evaluation of a speech translation system that supports the design of target application systems and determines their performance. Evaluations are conducted on the Japanese-toEnglish ATR-MATRIX speech translation system, which was developed at ATR Interpreting Telecommunications Research Laboratories. In the proposed ...

متن کامل

End-to-end evaluation in ATR-MATRIX: speech translation system between English and Japanese

ATR Interpreting Telecommunications Research Laboratories developed ATR-MATRIX speech translation system, which translates both ways between English and Japanese, enough to hold natural on-line real-time conversations. Using this system we started an end-to-end evaluation of a speech translation system through a dialog test with naive speakers who are not involved in system development and not ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Solutions to Problems Inherent in Spoken-language Translation: The ATR-MATRIX Approach

ATR has built a multi-language speech translation system called ATR-MATRIX. It consists of a spoken-language translation subsystem, which is the focus of this paper, together with a highly accurate speech recognition subsystem and a high-definition speech synthesis subsystem. This paper gives a road map of solutions to the problems inherent in spoken-language translation. Spokenlanguage transla...

متن کامل

A Japanese-to-English speech translation system: ATR-MATRIX

We have built a new speech translation system called ATR-MATRIX (ATR's Multilingual Automatic Translation System for Information Exchange). This system can recognize natural Japanese utterances such as those used in daily life, translate them into English and output synthesized speech. This system is running on a workstation or a high-end PC and achieves nearly real-time processing. The current...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000